Epidemics and percolation in small-world networks 
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We study some simple models of disease transmission on small-world networks, in which either 
the probability of infection by a disease or the probability of its transmission is varied, or both. 
The resulting models display epidemic behavior when the infection or transmission probability rises 
above the threshold for site or bond percolation on the network, and we give exact solutions for 
the position of this threshold in a variety of cases. We confirm our analytic results by numerical 
simulation. 
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I. INTRODUCTION 

It has long been recognized that the structure of social 
networks plays an important role in the dynamics of dis- 
ease propagation. Networks showing the "small-world" 
effect where the number of "degrees of separation" 
between any two members of a given population is small 
by comparison with the size of the population itself, show 
much faster disease propagation than, for instance, sim- 
ple diffusion models on regular lattices. 

Milgram |^ was one of the first to point out the exis- 
tence of small-world effects in real populations. He per- 
formed experiments which suggested that there are only 
about six intermediate acquaintances separating any two 
people on the planet, which in turn suggests that a highly 
infectious disease could spread to all six billion people on 
the planet in only about six incubation periods of the 
disease. 

Early models of this phenomenon were based on ran- 
dom graphs [^j^. However, random graphs lack some 
of the crucial properties of real social networks. In par- 
ticular, social networks show "clustering," in which the 
probability of two people knowing one another is greatly 
increased if they have a common acquaintance |q]. In 
a random graph, by contrast, the probability of there 
being a connection between any two people is uniform, 
regardless of which two you choose. 

Watts and Strogatz |^ have recently suggested a new 
"small-world model" which has this clustering property, 
and has only a small average number of degrees of sepa- 
ration between any two individuals. In this paper we use 
a variant of the Watts-Strogatz model to investigate 
disease propagation. In this variant, the population lives 
on a low-dimensional lattice (usually a one-dimensional 
one) where each site is connected to a small number of 
neighboring sites. A low density of "shortcuts" is then 
added between randomly chosen pairs of sites, producing 
much shorter typical separations, while preserving the 
clustering property of the regular lattice. 

Newman and Watts gave a simple differential equa- 
tion model for disease propagation on an infinite small- 
world graph in which communication of the disease takes 
place with 100% efficiency — all acquaintances of an in- 
fected person become infected at the next time-step. 



They solved this model for the one-dimensional small- 
world graph, and the solution was later generalized to 
higher dimensions |^ and to finite-sized lattices In- 
fection with 100% efficiency is not a particularly realistic 
model however, even for spectacularly virulent diseases 
like Ebola fever, so Newman and Watts also suggested 
using a site percolation model for disease spreading in 
which some fraction p of the population are considered 
susceptible to the disease, and an initial outbreak can 
spread only as far as the limits of the connected cluster 
of susceptible individuals in which it first strikes. An 
epidemic can occur if the system is at or above its per- 
colation threshold where the size of the largest cluster 
becomes comparable with the size of the entire popula- 
tion. Newman and Watts gave an approximate solution 
for the fraction pc of susceptible individuals at this epi- 
demic point, as a function of the density of shortcuts on 
the lattice. In this paper we derive an exact solution, and 
also look at the case in which transmission between indi- 
viduals takes place with less than 100% efficiency, which 
can be modeled as a bond percolation process. 



II. SITE PERCOLATION 

Two simple parameters of interest in epidemiology are 
susceptibility, the probability that an individual exposed 
to a disease will contract it, and transmissihility, the 
probability that contact between an infected individual 
and a healthy but susceptible one will result in the latter 
contracting the disease. In this paper, we assume that a 
disease begins with a single infected individual. Individ- 
uals are represented by the sites of a small-world model 
and the disease spreads along the bonds, which represent 
contacts between individuals. We denote the sites as be- 
ing occupied or not depending on whether an individual 
is susceptible to the disease, and the bonds as being occu- 
pied or not depending on whether a contact will transmit 
the disease to a susceptible individual. If the distribution 
of occupied sites or bonds is random, then the problem 
of when an epidemic takes place becomes equivalent to a 
standard percolation problem on the small-world graph: 
what fraction pc of sites or bonds must be occupied before 
a "giant component" of connected sites forms whose size 
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FIG. 1. A small-world graph with L = 24, k = 1, and four 
shortcuts. The colored sites represented susceptible individ- 
uals. The susceptibility is p = | in this example. 

scales extensively with the total number L of sites ||T^ ? 

We will start with the site percolation case, in which 
every contact of a healthy but susceptible person with 
an infected person results in transmission, but less than 
100% of the individuals are susceptible. The fraction p 
of occupied sites is precisely the susceptibility defined 
above. 

Consider a one-dimensional small-world graph as in 
Fig. 1^. L sites are arranged on a one-dimensional lattice 
with periodic boundary conditions and bonds connect all 
pairs of sites which are separated by a distance of k or 
less. (For simplicity we have chosen A; = 1 in the figure.) 
Shortcuts are now added between randomly chosen pairs 
of sites. It is standard to define the parameter cj) to be the 
average number of shortcuts per bond on the underlying 
lattice. The probability that two randomly chosen sites 
have a shortcut between them is then 
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for large L. 

A connected cluster on the small-world graph con- 
sists of a number of local clusters — occupied sites which 
are connected together by the near-neighbor bonds on 
the underlying one-dimensional lattice — which are them- 
selves connected together by shortcuts. For k — 1, the 
average number of local clusters of length i is 



TV, - (1 - p) Vi- 

For general k we have 

= {l-qfpq'-'L, 



(2) 
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where q — 1 — {1 ~ p)'^. 

Now we build a connected cluster out of these local 
clusters as follows. We start with one particular local 
cluster, and first add to it all other local clusters which 



can be reached by traveling along a single shortcut. Then 
we add all local clusters which can be reached from those 
new ones by traveling along a single shortcut, and so forth 
until the connected cluster is complete. Let us define a 
vector V at each step in this process, whose components 
Vi are equal to the probability that a local cluster of size i 
has just been added to the overall connected cluster. We 
wish to calculate the value v' of this vector in terms of 
its value v at the previous step. At or below the percola- 
tion threshold the Vi are small and so the v'^ will depend 
linearly on the Vi according to a transition matrix M 
thus: 



where 



My = iv,(i-(i-7/^r^). 
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Here Ni is the number of local clusters of size i as before, 
and 1 — (1 — ipy^ is the probability of a shortcut from a 
local cluster of size i to one of size j, since there are ij 
possible pairs of sites by which these can be connected. 
Note that M is not a stochastic matrix, i.e., it is not 
normalized so that its rows sum to unity. 

Now consider the largest eigenvalue A of M. If A < 1, 
iterating Eq. makes v tend to zero, so that the rate at 
which new local clusters are added falls off exponentially, 
and the connected clusters are finite with an exponential 
size distribution. Conversely, if A > 1, v grows until 
the size of the cluster becomes limited by the size of the 
whole system. The percolation threshold occurs at the 
point A = 1. 

For finite L finding the largest eigenvalue of M is dif- 
ficult. However, if (j) is held constant, tp tends to zero as 
L — > oo, so for large L we can approximate M as 



(6) 



This matrix is the outer product of two vectors, with the 
result that Eq. (|) can be simplified to 



Awj = itpN^y^Jvj, 



(7) 



where we have set v'^ — Xvi . Thus the eigenvectors of M 
have the form Vi = CX^^itpNi where C = J2j j'^j ^ 
constant. Eliminating C then gives 

A-0^/7V,. 
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For fc = 1, this gives 

A = %l)Lp — '—^ = 2(j)p ■ ^ ^ 



(8) 
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Setting A = 1 yields the value of (f> at the percolation 
threshold Pc- 
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and solving for pc gives 
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For general fc, we have 



X^i.Lp\±1^2kc^p ^-}'-f ^ (12) 
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or, at the threshold 
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The threshold density pc is then a root of a polynomial 
of order k + 1. 



III. BOND PERCOLATION 

An alternative model of disease transmission is one in 
which all individuals are susceptible, but transmission 
takes place with less than 100% efficiency. This is equiv- 
alent to bond percolation on a small-world graph — an 
epidemic sets in when a sufficient fraction p^ of the bonds 
on the graph are occupied to cause the formation of a 
giant component whose size scales extensively with the 
size of the graph. In this model the fraction p of occupied 
bonds is the transmissibility of the disease. 

For fc = 1, the percolation threshold Pc for bond perco- 
lation is the same as for site percolation for the following 
reason. On the one hand, a local cluster of i sites now 
consists of i — 1 occupied bonds with two unoccupied 
ones at either end, so that the number of local clusters 
of i sites is 
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which has one less factor of p than in the site percolation 
case. On the other hand, the probability of a shortcut 
between two random sites now has an extra factor of p 
in it — it is equal to ipp instead of just ■0. The two factors 
of p cancel and we end up with the same expression for 
the eigenvalue of M as before, Eq. (H), and the same 
threshold density, Eq. (|ll|). 

For fc > 1, calculating Ni is considerably more difficult. 
We will solve the case k = 2. Let Qi denote the proba- 
bility that a given site n and the site n — 1 to its left are 
part of the same local cluster of size i, when only bonds 
to the left of site n are taken into account. Similarly, let 
Qij be the probability that n and n — 1 are part of two 
separate local clusters of size i and j respectively, again 
when only bonds to the left of n are considered. Then, 
by considering site n -I- 1 which has possible connections 
to both n and n — 1, we can show that 



^ _ , (1 -p)2[Qj +EfeQj7c] fori = , . 
^'+''^ ~ ^ p{l~p)Q,^ for I > 1, ^'^^ 



and 



Q,+i = p{2 - p) Q, +p{l-p)J2 Q^l +P'Y1 Qn' ■ 

j j+3'=i 

(16) 

If we define generating functions H{z) = QiZ^ and 
H{z,w) — QijZ^w^ , this gives us 

H{z, w) = z(l - pf [H{w) + H{w, 1)] 

+ zp{\-p)H{w,z), (17) 
H{z) = zp{2 - p)H{z) + zp{l - p)H{z, 1) 

+ zp^H{z,z). (18) 

Since any pair of adjacent sites must belong to some 
cluster or clusters (possibly of size one), the proba- 
bilities Qi and Qij must sum to unity according to 
J2^ Q^ + E^,J Q^J = 1> Or equivaleutly 7?(l)+i?(l, 1) = 1. 
Finally, the density of clusters of size i is equal to the 
probability that a randomly chosen site is the rightmost 
site of such a cluster, in which case neither of the two 
bonds to its right are occupied. Taken together, these 
results imply that the generating function for clusters, 
G{z) — J2i ^i^^, must satisfy 

G{z) = il-p)^[H{z) + H{z,l)]. (19) 

Solving Eqs. (|l|) and (|l|) for H{z) and H{z,l) then 
gives 

G(z) = [z(l - pf{l - 2pz + p3(l - z)z + p'z^)] I 
[1 - 4pz + p\2 - 3z)z'^ - / (I - z)z^ 
+ /z2(l + 3z) +p2;z(4 + 32) 
-p^z{l + 5z + z'^)], 

the first few terms of which give 

N,/L={l-p)\ 
N2/L^2p{\~pf, 
m/L^p^{l-p)''{6~8p + 3p^). 

Again replacing ip with ipp, Eq. (^ becomes 



(20) 



(21) 
(22) 
(23) 



(24) 



which, setting fc = 2, implies that the percolation thresh- 
old Pc is given by 



{1-Pcf{l-Pc+Pl) 



Apc (1 + 3p2 - 3p3 _ 2pi + 5p5 - 2p6) ■ 



(25) 



In theory it is possible to extend the same method to 
larger values of fc, but the calculation rapidly becomes 
tedious and so we will, for the moment at least, move on 
to other questions. 
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IV. SITE AND BOND PERCOLATION 

The most general disease propagation model of this 
type is one in which both the susceptibility and the trans- 
missibility take arbitrary values, i.e., the case in which 
sites and bonds are occupied with probabilities Psitc and 
Pbond respectively. For fc = 1 , a local cluster of size i then 
consists of i susceptible individuals with i — 1 occupied 
bonds between them, so that 



Replacing ip with ■(/'pb 



Psitc Pbond ) Psito -Pbond 

ond in Eq. 
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gives 
l+p 



(26) 



(27) 



where p = Psito Pbond- In other words, the position of 
the percolation transition is given by precisely the same 
expression as before except that p is now the product 
of the site and bond probabilities. The critical value 
of this product is then given by Eq. (pi]). The case of 
A: > 1 we leave as an open problem for the interested 
(and courageous) reader. 



V. NUMERICAL CALCULATIONS 

We have performed extensive computer simulations of 
percolation and disease spreading on small-world net- 
works, both as a check on our analytic results, and to 
investigate further the behavior of the models. First, we 
have measured the position of the percolation threshold 
for both site and bond percolation for comparison with 
our analytic results. A naive algorithm for doing this fills 
in either the sites or the bonds of the lattice one by one 
in some random order and calculates at each step the size 
of either the average or the largest cluster of connected 
sites. The position of the percolation threshold can then 
be estimated from the point at which the derivative of 
this size with respect to the number of occupied sites or 
bonds takes its maximum value. Since there are 0(L) 
sites or bonds on the network in total and finding the 
clusters takes time 0(L), such an algorithm runs in time 
O(L^). A more efficient way to perform the calculation 
is, rather than recreating all clusters at each step in the 
algorithm, to calculate the new clusters from the ones at 
the previous step. By using a tree-like data structure to 
store the clusters [ pT| , one can in this way reduce the time 
needed to find the value of pc to O(Llogi). In Fig. || 
we show numerical results for pc from calculations of the 
largest cluster size using this method for systems of one 
million sites with various values of fc, for both bond and 
site percolation. As the figure shows, the results agree 
well with our analytic expressions for the same quanti- 
ties over several orders of magnitude in 0. 

Direct confirmation that the percolation point in these 
models does indeed correspond to the threshold at which 



1.0 



0.8 



0.6 - 



3 0.4 



0.2 



0.0 





1 




1 1 1 



10 10 10"" 10 10" 10" 

shortcut density (|) 



10" 



10 



FIG. 2. The points are numerical results for the percolation 
threshold as a function of shortcut density (j) for systems of 
size L — 10^. Left panel: site percolation with fc = 1 (circles), 
2 (squares), and 5 (triangles). Right panel: bond percolation 
with fc = 1 (circles) and 2 (squares). Each point is averaged 
over 1000 runs and the resulting statistical errors are smaller 
than the points. The solid lines are the analytic expressions 
for the same quantities, Eqs. (0, (^, and (||). The slight 
systematic tendency of the numerical results to overestimate 
the value of pc is a finite-size efi'ect which decreases both with 
increasing system size and with increasing ((> |l^ . 



epidemics first appear can also be obtained by numerical 
simulation. In Fig. ^ we show results for the number of 
new cases of a disease appearing as a function of time in 
simulations of the site-percolation model. (Very similar 
results are found in simulations of the bond-percolation 
model.) In these simulations we took fc = 5 and a value of 
(j) — O.Ol for the shortcut density, which implies, following 
Eq. (^, that epidemics should appear if the susceptibil- 
ity within the population exceeds Pc — 0.401. The curves 
in the figure are (from the bottom upwards) p — 0.40, 
0.45, 0.50, 0.55, and 0.60, and, as we can see, the num- 
ber of new cases of the disease for p = 0.40 shows only a 
small peak of activity (barely visible along the lower axis 
of the graph) before the disease fizzles out. Once we get 
above the percolation threshold (the upper four curves) 
a large number of cases appear, as expected, indicating 
the onset of epidemic behavior. In the simulations de- 
picted, epidemic disease outbreaks typically affected be- 
tween 50% and 100% of the susceptible individuals, com- 
pared with about 5% in the sub-critical case. There is 
also a significant tendency for epidemics to spread more 
quickly (and in the case of self-limiting diseases presum- 
ably also to die out sooner) in populations which have a 
higher susceptibility to the disease. This arises because 
in more susceptible populations there are more paths for 
the infection to take from an infected individual to an un- 
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FIG. 3. The number of new cases of a disease appearing 
as a function of time in simulations of the site-percolation 
model with L = lO*', fc = 5, and = 0.01. The top four 
curves are for p = 0.60, 0.55, 0.50, and 0.45, all of which are 
above the predicted percolation threshold of pc — 0.401 and 
show evidence of the occurrence of substantial epidemics. A 
fifth curve, for p = 0.40, is plotted but is virtually invisible 
next to the horizontal axis because even fractionally below 
the percolation threshold no epidemic behavior takes place. 
Each curve is averaged over 1000 repetitions of the simulation. 
Inset: the total percentage of the population infected as a 
function of time in the same simulations. 

infected one. The amount of time an epidemic takes to 
spread throughout the population is given by the average 
radius of (i.e., path length within) connected clusters of 
susceptible individuals, a quantity which has been stud- 
ied in Ref. §. 

In the inset of Fig. ^ we show the overall (i.e., inte- 
grated) percentage of the population affected by the dis- 
ease as a function of time in the same simulations. As the 
figure shows, this quantity takes a sigmoidal form simi- 
lar to that seen also in random-graph models , simple 
small- world disease models 0, and indeed in real- world 
data. 



VI. CONCLUSIONS 

We have derived exact analytic expressions for the 
percolation threshold on one-dimensional small-world 
graphs under both site and bond percolation. These 
results provide simple models for the onset of epidemic 
behavior in diseases for which either the susceptibility 
or the transmissibility is less than 100%. We have also 
looked briefly at the case of simultaneous site and bond 
percolation, in which both susceptibility and transmis- 
sibility can take arbitrary values. We have performed 
extensive numerical simulations of disease outbreaks in 



these models, confirming both the position of the perco- 
lation threshold and the fact that epidemics take place 
above this threshold only. 

Finally, we point out that the method used here can 
in principle give an exact result for the site or bond per- 
colation threshold on a small-world graph with any un- 
derlying lattice for which we can calculate the density of 
local clusters as a function of their size. If, for instance, 
one could enumerate lattice animals on lattices of two 
or more dimensions, then the exact percolation thresh- 
old for the corresponding small- world model would follow 
immediately. 
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